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Abstract 

Genetic and biochemical evidence suggests that X Orf is a recombination mediator, promoting nucleation of either bacterial 
RecA or phage Red/? recombinases onto single-stranded DNA (ssDNA) bound by SSB protein. We have identified a diverse 
family of Orf proteins that includes representatives implicated in DNA base flipping and those fused to an HNH 
endonuclease domain. To confirm a functional relationship with the Orf family, a distantly-related homolog, YbcN, from 
Escherichia coli cryptic prophage DLP12 was purified and characterized. As with its X relative, YbcN showed a preference for 
binding ssDNA over duplex. Neither Orf nor YbcN displayed a significant preference for duplex DNA containing mismatches 
or 1-3 nucleotide bulges. YbcN also bound E. coli SSB, although unlike Orf, it failed to associate with an SSB mutant lacking 
the flexible C-terminal tail involved in coordinating heterologous protein-protein interactions. Residues conserved in the Orf 
family that flank the central cavity in the X Orf crystal structure were targeted for mutagenesis to help determine the mode 
of DNA binding. Several of these mutant proteins showed significant defects in DNA binding consistent with the central 
aperture being important for substrate recognition. The widespread conservation of Orf-like proteins highlights the 
importance of targeting SSB coated ssDNA during lambdoid phage recombination. 
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Introduction 

Recombination in bacteriophages salvages genomes for pack- 
aging by restoring damaged or broken molecules via exonuclease 
processing and annealing. Illegitimate exchanges promote rapid 
evolution as new gene combinations or acquisitions can be 
generated during joint formation. In phage X the Red system is 
responsible for exchanges at DNA ends [1]. Redy protein serves to 
inhibit the Escherichia coli RecBCD exonuclease, ensuring that 
rolling circle replication can proceed [2,3]. X exonuclease (Redot) 
degrades the DNA duplex to generate 3' ssDNA tails, which are 
bound by Red/? protein, a strand annealing protein that searches 
for homologous ssDNA sequences [1,4]. The combined action of 
Reda and Red/? would be expected to favor splice-type 
recombinants, although there is evidence to indicate that 
annealing events occur regularly in the context of exposed ssDNA 
at a replication fork [5-7]. Thus, DNA synthesis primed by the 3' 
annealed strand or by template switching can provide a means of 
generating intact genomes suitable for incorporation into capsids. 



Phage X encodes an ancillary recombinase called Orf, which 
was identified by its ability to complement the recombination 
deficiencies apparent in recF, recO and recR mutants of E. coli 
[8,9]. The products of recF, recO and recR [10] function 
predominantly at ssDNA gaps and aid assembly of the primary 
bacterial recombinase, RecA, helping to overcome the inhibitory 
effects of ssDNA binding protein (SSB). Further in vivo studies 
revealed that the susceptibility of recFOR mutants to ultraviolet 
light could also be overcome by Orf [1 1] . Orf is a 1 7 kDa protein 
that behaves as a dimer in solution and interacts with E. coli SSB 
protein. It binds equally well to ssDNA, gapped duplex and 5' and 
3' flap DNA and less well to fully duplex substrates [12]. The Orf 
crystal structure revealed an asymmetrical toroid with a central 
tunnel flanked by positively charged residues [12,13]. Although 
the channel is too narrow to accommodate dsDNA, it could 
potentially encompass ssDNA. One side of the central cavity 
comprises a RAGNYA motif found in a wide range of DNA 
binding proteins [14]. DNA binding could also occur along a 
shallow groove across the protein surface since Orf bound equally 
well to a ssDNA flanked by duplexes, which would prohibit 
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threading of ssDNA through the central hole [12]. Residues in the 
flanking C-terminal helices also contribute to stabilizing the 
association with DNA [13]. 

In this study we describe a family of phage and prophage Orf 
proteins exhibiting considerable sequence diversity but sharing 
common ancestry within a core domain. To substantiate their 
functional relatedness we purified one of the most dissimilar 
members, YbcN, from the E. coli cryptic prophage DLP12 and 
compared its properties with its counterpart from phage X. YbcN 
was previously identified as a T:G mismatch and base-flipping 
enzyme [15], an activity that is not easily reconciled with a role in 
suppressing the genetic recombination and UV light repair defects 
of recF, recO and recR mutants. 

YbcN behaved remarkably like X Orf in its preference for 
binding ssDNA and in its association with E. coli SSB protein, 
supporting a functional relationship between the two phage 
proteins. However, some differences were noted in their interac- 
tions with DNA and SSB. YbcN bound much less well than Orf to 
ssDNA, although both proteins did show improved binding to 
bubble DNA structures. In addition, YbcN showed a specific 
interaction with the C-terminus of SSB, a feature not shared by X 
Orf. Neither Orf nor YbcN exhibited an enhanced affinity for 
DNA containing a single G:G mismatch over fully duplex DNA, 
although Orf showed a slight preference for a T:G mispair. A 
number of mutants in X Orf were generated to investigate the 
importance of the central cavity in DNA binding. Several of these 
mutants exhibited significant defects in DNA binding consistent 
with ssDNA passing through the interior of the Orf ring. These 
results, in combination with the ability of Orf to recognize bubble 
structures, support a clamp model for Orf assembly. The 
conservation of ssDNA and SSB binding activities in diverse 
members of the Orf family underscores the importance of 
facilitated loading of a partner recombinase for the successful 
initiation of phage recombination. 

Results 

The phage Orf family 

A position-specific iterative BLAST search performed with the X 
Orf protein identified homologs exclusively from phage genomes, 
predominantly Myoviridae, Siphoviridae and Podoviridae, or 
associated with prophage-like elements (Pfam05772 [16]). The 
initial BLASTP search uncovered multiple closely related 
sequences with an expectation value of less than 5xl0 -5 . All of 
these proteins displayed greater than 30% identity with Orf. At the 
first PSI-BLAST iteration additional sequences with an expecta- 
tion value of less than 4xl0~ were identified, including a highly 
diverged Orf-like protein (YbcN) from the DLP12 cryptic 
prophage of E. coli belonging to the PRK09741 conserved 
domain [17], currendy classified as distinct from the PF05772 
NinB [X Orf) family. This gene matches almost exactiy (97% 
identity) the ybcN (orf 151) gene of phage 82 [18], yet shares only 
16% overall identity with Orf. Further searches initiated with 
DLP12 YbcN recovered typical Orf proteins including those from 
Aa^23, Stx2 and X at the second PSI-BLAST iteration confirming 
the relationship. YbcN showed significant similarity to several 
prophage-associated proteins including E. coli 0157:H7 (GP- 
933X; 93% identity), Shigella sonnei (81% identity), Photorhabdus 
luminescens (41% identity) and Neisseria gonorrhoeae (27% 
identity). The Phyre 2 protein fold recognition tool was used to 
compare the predicted secondary structures of DLP12, P. 
luminescens and N. gonorrhoeae YbcN proteins with known 
three-dimensional structures in the structural classification of 
proteins (SCOP) database. In all three cases, the X Orf fold was 



returned with >95% estimated precision. Representative Orf 
family proteins identified in this study varied in length from 1 24- 
181 residues; a list, including accession numbers, is available in 
Table SI. 

A phylogram was prepared using the PAUP program to 
visualize similarity between these selected Orf family representa- 
tives (Figure SI). In this comparison there appear to be at least 
four discrete Orf groups occurring in phages from diverse species, 
although most belong to the ft and y proteobacteria. As might be 
expected, examples from closely related species tend to cluster 
together. For example, one of the groups contains highly related 
Orf proteins displaying 91-99% identity and is principally derived 
from E. coli 0157:H7 or 084:H" resident prophages (Figure SI, 
shaded in blue). However, this is not always the case as other 
clusters incorporate representatives from E. coli H phage HK620, 
Actinobacillus actinomycetemcomitans phage Aa</>23 and Hae- 
mophilus influenzae prophages. Indeed some orthologs, mainly 
from prophages residing in dissimilar bacterial species, do not fit 
unambiguously into any of the above groups. These discrepancies 
illustrate the difficulty in unraveling phage and prophage lineages 
due to high rates of horizontal gene transfer and mutation. 

The Listeria proteins belong to the Orf family but are distandy 
related, sharing only 18% identity with X Orf (Figure SI and Table 
SI). Both L. innocua Orf-like proteins belong to yet another 
distinct conserved domain DUF968 (Pfam06147), which includes 
homologous polypeptides from Staphylococcus, Lactobacillus and 
Enterococcus phages and prophages. The N-terminal domain of 
these latter proteins share 23-24% identity with X Orf consistent 
with a distant relationship with the Orf family. However, members 
of the DUF968 family possess an additional C-terminal extension, 
absent from the majority of Orf homologs, which incorporates a 
zinc finger HNH endonuclease motif [19]. These phages may 
therefore encode a novel single-strand specific deoxyribonuclease 
arising from fusion of an Orf DNA binding domain to an HNH 
nuclease domain. It is interesting to note that the DUF968 domain 
is found fused to a number of recombinases, including the strand 
annealing proteins, Erf and Rad52, and the Holliday junction 
resolvase, RusA [17]. The HNH region alone from this family is 
also found joined to the mismatch recognition factor, MutS. 

Orf family gene organization among lambdoid phages 

The X orf gene lies in the ninR region between replication gene 
P and the Q_ antiterminator (Figure 1 A). In addition to Orf, this 
section encodes two other genes associated with DNA metabolism: 
Rap (NinG), a DNA branch-specific endonuclease [20] and NinH, 
a dsDNA binding protein (G.J. Sharpies, unpublished results). 
Where possible to determine based on homology of the 
surrounding genes, orf resides in a similar context in many phages 
and prophages, sandwiched between replication and lysis functions 
(Figure 1A). Significant shuffling of the genes in this region is 
evident with acquisition of unrelated genes, many of unknown 
function, and elimination of others. Despite these rearrangements, 
there are notable consistencies in gene order. For instance, 
preservation of synteny is particularly evident in the occurrence of 
the short ninD, ninE and ninF genes (Figure 1 A). In every case orf 
lies upstream of rap or rusA genes, which specify structurally 
distinct Holliday junction DNA resolvases [21]. The genetic 
organization of the orf-like gene in DLP12 (ybcN) located just three 
genes 5' of rusA adds support to the notion that it is a genuine 
ortholog of X Orf (Figure 1A). 

The gene arrangement is radically different among the DUF968 
(Pfam06147 [17]) Orf-like proteins from Listeria and Staphylococ- 
cus in keeping with phage sequences from different evolutionary 
origins. However, homologs of dnaB, dnaC or ssb are located close 
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Figure 1. Conservation of genomic organization and protein architecture among selected members of the Orf family. (A) Conserved 
gene order of orf relative to the replication and lysis genes of lambdoid phages and prophages. Genes are shown as boxes and colored according to 
their order in the X n'mR region; in almost all cases they are transcribed from left to right. Orthologous genes are highlighted by name and color. 
Those that failed to match any of the X genes in this region are shown as white boxes (the residue length of unannotated open-reading frames is 
indicated). Additional abbreviations refer to gene products that match the HNH nuclease domain family (HNH), DNA/RNA helicase superfamily II (SFII) 
or phage head proteins (head). The percentage identity to X Orf is shown for each Orf homolog and listed in order of similarity. Many of the 
prophages are reservoirs for insertion sequences (shaded in grey); note that additional genes encoded by these elements are not shown. The putative 
64-residue polypeptide between Q and S in X was also omitted. (B) Conserved primary structure of X Orf, DLP12 YbcN and ^ETA Orf20 proteins. X Orf 
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is aligned with homologous representatives from Y. pestis (Ype), E. coli 0157:H7 prophage Stx2-converting phage VT2-Sa and E. coli H phage HK620. 
DLP12 YbcN is aligned with a prophage homolog from P. luminescens (Plu). Staphylococcus aureus j^ETA Orf20 is aligned with a homolog from 
Enterococcus faecalis (Efa). Conserved residues within each subset are highlighted in blue (basic), red (acidic) and lilac (others); those occurring in both 
Orf/YbcN and Orf/Orf20 alignments are marked with an asterisk. Secondary elements from the crystal structure of X Orf [12] are indicated above the 
alignment, as are the site-directed mutants Q45A, K48A, W50A, R103E, V106E and W137A. A pair of cysteines from the HNH motif are highlighted in 
orange. 

doi:1 0.1 371 /journal.pone.01 02454.g001 



to the predicted orf gene in many of these sequences, linking these 
regions with functionality in DNA metabolism. 

Sequence similarity between X Orf and DLP12 YbcN 

Secondary structure predictions and modeling using Jpred and 
Phyre 2 with DLP12 YbcN (and representatives of the DUF968 
family) indicated a very similar overall architecture to that found 
with A Orf (Figure S2). The main differences appear to be in the 
vicinity of a3 and /?3 and in the C-terminal a-helix (Figure IB). 
Any alterations in this first region could potentially affect 
dimerisation, although there are unlikely to be gross distortions 
in the overall fold. By examining Orf family sequence alignments 
alongside the crystal structure of X Orf [12], functionally important 
residues in Orf and YbcN homologs were identified (Figure IB). 
Nine hydrophobic residues from two discrete segments of Orf, 
namely /?l-al-/?2 (comprising the RAGNYA motif [14]) and a3- 
/?3-/?4, contribute to the dimer interface. Four of these residues are 
conserved in YbcN homologs, while the remainder could be 
replaced by adjacent functionally equivalent amino acids (Fig- 
ure IB). Interestingly, the most highly conserved residues cluster 
around a-helix 2 and flank the positively charged channel that 
traverses the dimer [12]. Equivalents of X Orf Arg41, Gln45, 
Asn46 and Arg48 are present in the majority of Orf sequences 
(Figure IB, Figure S2 and data not shown). Arg41 from subunit B 
lies on the rim of the dimer cavity and could potentially contact 
the ssDNA backbone. In contrast, Arg41 in subunit A is hidden at 
the base of the cavity. Gln45 and Asn46 residues are symmetrically 
orientated and project from opposite sides towards the central 
channel. Asn46, in combination with Trp50 (which occurs as Trp 
or His in almost all Orf sequences), could interact with nucleotide 
bases if ssDNA can be induced to enter the ring interior. Finally, 
Arg48 is located at the rim of the aperture in each subunit, on 
opposite faces of the dimer, and could also make contacts with the 
C-terminal a-helix. 

Purification of DLP12 MBP-YbcN 

To resolve whether members of the PRK09741 (YbcN) group 
are genuine functional equivalents of X Orf, we set out to purify 
YbcN from E. coli DLP12 and validate its properties in vitro. 
Plasmids carrying the ybcN gene were constructed in order to 
overexpress wild-type and an N-His 6 fusion of YbcN protein. High 
levels of expression were achieved, however, all of the YbcN 
recovered precipitated with the cellular debris upon lysis (data not 
shown). In an attempt to recover soluble protein we designed a 
construct that fuses the E. coli maltose binding protein (MBP) to 
the N-terminus of YbcN. Large quantities of soluble YbcN were 
recovered and the protein purified to near homogeneity by 
amylose and heparin chromatography. A GST- YbcN fusion was 
also purified at the same time using glutathione sepharose, heparin 
agarose and Q_ sepharose chromatography steps. MBP-YbcN and 
GST- YbcN proteins were analyzed in parallel with purified MBP- 
Orf and GST-Orf fusions from bacteriophage X [13]. 

YbcN binds preferentially to ssDNA 

X Orf protein favors binding to ssDNA over dsDNA in keeping 
with its predicted role in the initial steps of recombinational 



exchange [12]. To confirm that YbcN is functionally analogous to 
Orf, we investigated the ability of MBP-YbcN to bind short 
ssDNA (SS 5 o) and dsDNA (DS 30 ) substrates in an electrophoretic 
mobility shift assay carried out in the presence of EDTA to 
exclude metal ions. Similar to MBP-Orf (Figure 2A, lanes b-e), 
MBP-YbcN formed a single complex with the 50 nucleotide (nt) 
ssDNA substrate (Figure 2A, lanes l-o). In both cases smearing of 
the substrate is indicative of unstable protein-ssDNA complexes 
that tend to dissociate during electrophoresis. MBP-YbcN 
exhibited a lower affinity for ssDNA relative to MBP-Orf protein 
(Figure 2B). Replacing EDTA with 1 mil MgCl 2 , in both binding 
reaction and gel, significantly reduced binding of MBP-YbcN and 
MBP-Orf to SS 30 (Figure 2B). MBP-Orf was much more sensitive 
than MBP-YbcN to inclusion of Mg 2+ ions. No further reduction 
in ssDNA binding was observed when 10 mM MgCl 2 was used 
(data not shown). The results are consistent with a counterion 
effect and protein-DNA interactions occurring through electro- 
static contacts with the DNA backbone. Neither MBP-YbcN nor 
MBP-Orf bound well to 50 bp dsDNA, although both proteins did 
manage to form a stable complex with a small proportion of the 
substrate with smearing again consistent with a weak association 
(Figure 2A, lanes g-j and q-t). 

To provide more quantitative data on the relative affinities of 
YbcN and Orf for ssDNA, we employed fluorescence anisotropy 
using a fluorescein-labeled SS 60 substrate. This method relies on 
the fact that protein assembly onto DNA slows the spin of the 
fluorescently-labeled molecule in polarized light resulting in an 
increase in observed anisotropy (r ()bs ). Studies are typically 
conducted with DNA substrates of <40 bp in length but larger 
substrates of 70 bp also yield reliable data [22]. As in the gel shift 
assays, MBP-YbcN bound much less well to ssDNA than MBP- 
Orf, with a 21 -fold difference in their estimated dissociation 
constants (Table 1). 

Orf and YbcN binding to mismatch, bubble and bent 
DNA 

Since YbcN has been implicated in binding preferentially to 
DNA containing a T:G mismatch or abasic site [15], we compared 
the binding properties of MBP-YbcN and MBP-Orf on 60 bp 
DNA substrates with 1,5, 13 and 20 nt unpaired regions, utilizing 
60 nt ssDNA as a control. The larger bubble structures resemble 
structures more likely to be encountered during recombination 
rather than DNA mismatch recognition and repair. MBP-Orf 
formed two complexes with SS 60 (Figure 2C, lane b), suggesting 
that the additional 10 nt permits assembly of two dimers of Orf 
compared to the 50 nt substrate (Figure 2A, lanes b-e). As noted 
before (Figure 2A), binding to dsDNA was less efficient and the 
complexes formed were prone to dissociation (Figure 2C, lane e). 
Inclusion of mismatches in the DNA affected the pattern of 
complexes formed, with BB 5 , BB l:i and BB 2 o showing improved 
binding and formation of a more stable, distinct complex 
(Figure 2C, lanes k, n and q); the complex formed with these 
substrates is actually composed of two closely-migrating species, 
possibly indicating two alternative conformations of Orf bound to 
bubble DNA. Binding of MBP-Orf to the single G:G mismatch 



PLOS ONE I www.plosone.org 



4 



August 2014 | Volume 9 | Issue 8 | e102454 



Phage X Orf Family Recombinases 



MBP-Orf MBP-YbcN 




abcdefghij klmnopqrst 




SS 60 DS 60 MM G:G BB 5 BB 13 BB 



-OY-OY-OY-OY-OY-OY 




abcdefghij klmnopqr 



Figure 2. Comparison of MBP-YbcN and MBP-Orf binding to 
DNA. (A) Binding to ssDNA and dsDNA. Gel mobility shift assays 
contained 0, 62.5, 125, 250 and 500 nM MBP-Orf or MBP-YbcN proteins, 
5 mM EDTA and either 0.3 nM of 32 P-labelled 50 nt (SS 50 ) ssDNA (lanes 
a-e and k-o) or 50 bp dsDNA (lanes f-j and p-t). (B) MBP-YbcN and MBP- 
Orf binding to ssDNA. Binding reactions contained 0.3 nM SS 50 with 0, 
7.81, 15.625, 31.25, 62.5, 125, 250 and 500 nM protein and either 5 mM 
EDTA or 1 mM MgCI 2 . Data are the mean and standard deviation of two 
independent experiments. (C) Binding to mismatch and bubble DNA. 
Gel mobility shift assays contained 250 nM MBP-Orf (O) or MBP-YbcN 
(Y) proteins, 5 mM EDTA and 0.15 nM of 32 P-labelled 60 nt (SS 60 ) ssDNA 
(lanes a-c), 60 bp (DS 60 ) dsDNA (lanes d-f), 1 bp (MM G:G ) mismatch 
(lanes g-i), 5 bp (BB 5 ) bubble (lanes j-l), 13 bp (BB 13 ) bubble (lanes m-o) 
and 20 bp (BB 20 ) bubble (lanes p-r). 
doi:1 0.1 371 /journal.pone.01 02454.g002 

(Figure 2C, lane h) did not differ significandy from that seen with 
the fully paired substrate (Figure 2C, lane e). 

MBP-YbcN formed two complexes on SS fi() (Figure 2C, lane c) 
and three with DS 60 , although with a slighdy altered mobility 
(Figure 2C, lane fj. Surprisingly, as the length of unpaired DNA 
increased from 1-20 nt, there was no marked improvement in 
DNA binding or stability of complex formation (Figure 2C, lanes i, 
1, o and r). A 2.8-fold preference for BB 2 o over SS 60 was apparent 
with MBP-YbcN in fluorescence anisotropy assays (Figure S3; 



Table 1). It is possible that the flanking duplexes in the bubble 
substrate help stabilize Orf, but not YbcN, complex formation by 
preventing protein dimers from sliding off the ends. 

The absence of any preference for MM G:G (a G:G mispair) 
relative to DS 60 (compare Figure 2C, lanes f and i) could be 
because YbcN has a high degree of specificity for single T:G base 
pair mismatches [15]. To assess this possibility, an MM T:G 
substrate with a T:G rather than a G:G mismatch was employed 
in fluorescence anisotropy assays. MBP-YbcN exhibited no 
significant differences in binding to MMt : g or a fully duplex 
control (Figure S4A). MBP-Orf showed some preference for the 
DNA containing a T:G mispair suggesting some specificity for this 
type of replication error, although the data sets varied, consistent 
with a relatively poor association with duplex DNA (Figure S4A). 
Binding of MBP-YbcN and MBP-Orf was also tested on substrates 
containing centrally located insertions of 1 , 2 or 3 adenines in one 
of the strands to generate bent DNA (Figure S4B). Adenine bulges 
of 1-3 nt are known to introduce a kink of 50-70° in duplex DNA 
[23,24]. These substrates resemble mismatches by reducing the 
thermodynamic stability of the helix and inducing localized 
distortion of the DNA backbone. Neither protein showed any 
preference in binding these three substrates relative to DS 60 
(Figure S4B). Hence under the conditions employed here, neither 
YbcN nor Orf appear to favor bent DNA or duplexes where 
mispaired or bulged nucleotides might encourage the formation of 
extrahelical bases. 

YbcN binds SSB 

Previous studies [12,13] revealed an interaction between X Orf 
and E. coli SSB consistent with a role at an early stage in phage 
recombination. Far western blotting experiments were conducted 
as in these earlier studies to determine whether an association with 
SSB is conserved in YbcN. As with X MBP-Orf, an interaction 
between MBP-YbcN and SSB bound to the membrane was 
detected with anti-MBP antibodies (Figure 3A, lanes a-c), 
suggesting that contacts between these proteins are preserved 
and are therefore functionally relevant. In fact, YbcN yielded a 
stronger signal than Orf [1 3] indicating that it has a higher affinity 
for SSB. No bands were detected using MBP alone to probe 
membrane-bound SSB protein showing that YbcN is necessary to 
detect binding to SSB. The association with SSB was further 
validated using a purified GST-YbcN fusion in far western blots 
probed with antibodies specific for the GST tag. The results 
mirrored those obtained with the MBP-tagged YbcN protein 
(Figure 3A, lanes d-f). 

A GST-YbcN interaction with SSB, similar to a positive control 
containing GST captured on the plate, was also detected in an 
enzyme-linked immunosorbent assay (Figure 3B). GST-Orf failed 
to produce a signal suggesting that some of the SSB epitopes 
normally recognized by Orf may be occluded by the manner of 
SSB attachment to the surface of the plate. Addition of a 51-nt 
ssDNA at the outset or upon addition of GST-Orf or GST-YbcN 
did not significantly alter the results obtained (data not shown). In 
addition, YbcN-SSB complex formation was probed using the 
yeast two-hybrid system. A clear interaction was detected between 
YbcN and SSB (Figure 3E). X Orf and E. coli SSB also associated 
in equivalent assays but much less strongly [13]. 

The C-terminus of SSB is required for YbcN binding 

Ten residues at the carboxy-terminus of E. coli SSB (168- 
PMDFDDDIPF-177) comprise a negatively-charged tail that 
functions as an assembly point for multiple proteins involved in 
DNA replication, recombination and repair [25]. X Orf has 
previously been shown to bind SSB in far western assays in the 
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Table 1. MBP-YbcN and MBP-Orf mutant dissociation constants. 



protein 


ss 60 


BB 2 o 


YbcN 


4602±275.1 


1619±3.2 


Orf wt 


223±72.6 


173±20.9 


Q45A 


272±20.4 


264±15.2 


K48A 


622 ±0.7 


551 ±6.4 


W50A 


3535±287.8 


1606±418.2 


R103E 






V106E 


522±12.4 


540 ±73.2 


W137A 


628 ±99.9 


285 ±1.9 



Fluorescence anisotropy assays were performed in 1 00 mM Tris.HCI pH 7.0 with 1 0 nM fluorescein-labeled DNA. The apparent K D (nM) was calculated for the protein as 
a dimer. Standard deviations of the duplicate data sets are shown. 
doi:1 0.1 371 /journal.pone.01 02454.t001 



presence or absence of this C-terminal region [13]. Similar 
experiments were carried out with MBP-YbcN using SSB 113, 
which carries a single amino acid substitution in this region 
(P176S) and is known to confer a temperature-sensitive phenotype 
[26], and SSBAC10, which lacks the last 10 residues entirely. 
Although SSB113 and SSBAG10 proteins transferred less well to 
the membrane than wt SSB (Figure 3C, row ii, lanes a-i), it does 
appear that the MBP-YbcN interaction with the SSB C-terminal 
mutants is reduced relative to the wt, especially with SSB 1 1 3 



(Figure 3, row i, lanes a-i). Residual binding could potentially be 
due to traces of wt SSB in the SSB1 13 and SSBAC10 samples as 
both mutant proteins were, by necessity, purified from strains 
carrying an intact ssb gene. 

The possible reduced association between YbcN and the two 
mutant SSB proteins was investigated further using the enzyme- 
linked immunosorbent assay with SSB bound to the plate and anti- 
GST antibody used to detect any interaction with GST- YbcN 
(Figure 3D). In this assay, no binding between YbcN and 
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Figure 3. YbcN binding to SSB. (A) Interaction of YbcN with SSB in far western assays. SSB protein (0.5, 1.9, 3.8 ug) separated on 15% SDS-PAGE 
was blotted and probed with 20 ug MBP-YbcN (lanes a-c) or GST-YbcN (lanes d-f). Interactions were detected with antibodies specific for either the 
MBP or GST domains. (B) Interaction of GST-YbcN with SSB as detected by ELISA. SSB (0.5 |ig) bound to lmmulon-1 microtitre plates was exposed 
either to GST-YbcN, GST-Orf or GST proteins (0.0195-10 ug). Positive interactions were detected by addition of anti-GST antibody-HRP conjugate. 
Data are the mean and standard deviation of four independent experiments. (C) YbcN binding to SSBAC1 0 and SSB1 1 3 proteins in far western assays. 
0.5, 1.9, 3.8 ug SSB (lanes a-c), SSBAC10 (lanes d-f) and SSB113 (lanes g-i) separated on 15% SDS-PAGE were blotted and probed with 20 |ig MBP- 
YbcN. Interactions were detected with antibodies specific for MBP (row i). Row ii shows the efficiency of SSB, SSBAC10 and SSB113 transfer onto 
blotted membranes as revealed by staining with amido black. Panel i (lanes a-c) is reproduced from (A) to facilitate comparisons with MBP-YbcN 
binding to wild-type SSB. (D) Interaction of GST-YbcN with SSB, SSBAC10, SSB113 and SSBA1 15-144 as detected by ELISA. SSB wild-type and mutant 
proteins (0.5 |ig) bound to microtitre plates were exposed to GST-YbcN (0.078-2.5 |tg) and probed as in (B). Data are the mean and standard deviation 
of two independent experiments. (E) Yeast two-hybrid analysis of YbcN interactions with SSB and SSBAC10. Experiments were performed using 
constructs fused to the GAL4 DNA-binding or activating domains. The p-galactosidase activities were the mean and standard deviation of three 
independent experiments. The vertical bars represent the p-galactosidase activity (Miller units) and values for YbcN-SSBAO 0 are given for pGBKT7-53 
and pGADT7-T constructs in both orientations. 
doi:1 0.1 371 /journal.pone.01 02454.g003 
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SSBAC10 or SSB113 was detected, whereas an interaction was 
readily observed with wt SSB. Another SSB deletion mutant, 
Al 15-144 [27], behaved like wt SSB, suggesting that this portion 
of SSB is dispensable for YbcN-SSB binding (Figure 3D). Anti- 
SSB antibodies were used to confirm that wt SSB, SSB 113, 
SSBAC10 and SSBAl 15-144 proteins all bound similarly well to 
the microtiter plate (data not shown). In addition, yeast two-hybrid 
analysis with the SSBAC10 mutant showed a significant reduction 
in binding to YbcN (Figure 3E). Taken together, the results are 
consistent with the last 10 residues of SSB being necessary for a 
stable YbcN interaction. This contrasts with experiments showing 
that X Orf retains the capacity to associate with SSB AC 10 and 
SSB113 mutants [13]. 

Secondary and quaternary structure of MBP-YbcN 

CD spectroscopy was used to provide information on the 
secondary structure and folding of MBP-YbcN and GST- YbcN 
proteins. MBP-YbcN yielded a typical spectrum for a globular 
protein and appears to be folded correcdy (Figure S5A). The 
predicted proportion of helices, sheet, turn and coil are similar to 
those observed with MBP-Orf, in keeping with a similar overall 
fold (Table S2). In contrast, GST- YbcN appears rather more 
disordered (Figure S5A) and secondary structure predictions do 
not differ significantiy from GST protein alone (Table S2). 
However, the protein does appear to be functional in binding to 
SSB (Figure 3) and ssDNA (Figure S5A, lanes j-1). 

Size exclusion chromatography was employed to investigate the 
oligomeric state of MBP-YbcN and GST- YbcN proteins. MBP- 
YbcN eluted with a relative molecular mass of 113 kDa in 
agreement with the 120 kDa predicted for a homodimer (Figure 
S5B). The dimeric species is relatively stable and can be detected 
as an ~ 1 70 kDa species in SDS-polyacrylamide gels (Figure S5B). 
Hence both YbcN and X Orf [13] exist as a homodimer in 
solution. Gel filtration analysis of GST- YbcN, however, identified 
three major peaks that could correspond to tetramer, octamer and 
a larger multimeric species (Figure S5B). Stable tetrameric species 
of 140-150 kDa (a monomer of GST- YbcN is expected to yield a 
44.5 kDa protein) were observed in SDS polyacrylamide gels 
(Figure SSB). Since GST is known to dimerize [28], higher 
molecular mass complexes consisting of dimer-dimer combinations 
are not unexpected. 

MBP-Orf substitution mutants flanking the central cavity 

A number of site-directed mutants were constructed in X Orf to 
help identify residues critical for interaction with ssDNA 
(Figure 4). Alanine substitution mutants were introduced in an 
aromatic residue projecting into the central cavity (W50A), as well 
as two nearby polar residues at the channel entrance (Q45A, 
K48A). These residues match several that are conserved within the 
Orf family (Figure IB and Figure S2). More radical glutamic acid 
substitutions (R103E, V106E) were generated further from the 
cavity entrance that could also potentially participate in stabilizing 
contacts with DNA (Figure 4). The latter two mutations were also 
chosen as they lie in a motif (102-SRMRVGE- 108) that resembles 
a region at the subunit-subunit interface in the E. coli RecA 
filament (25-SIMRLGE-31) [29,30] and could potentially mimic a 
RecA monomer to nucleate polymerization as a recombination 
mediator. A related motif has been identified in BRCA2 that 
promotes assembly of Rad51, the human RecA ortholog, at 
ssDNA gaps [29,31]. An additional alanine substitution was 
introduced in the C-terminal helix (W137A) as it was anticipated 
that it would behave similarly to mutants obtained previously in 
this region [13], which are known to impair DNA binding to linear 
ssDNA and dsDNA. All mutant proteins were purified as fusions 
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Figure 4. Size-exclusion chromatography of MBP-Orf mutant 
proteins. Proteins (1 mg/ml) were applied to a 24 ml Superose 12 HR 
10/30 column. In each panel, the crystal structure of Orf protein shows 
the location of the relevant substitution mutant. Chain A is colored cyan 
and Chain B, lilac; the projecting C-terminal helix is absent from the 
latter. Predicted molecular weights for MBP-Orf are 59.6 kDa (monomer) 
and 119.2 kDa (dimer). Oligomeric states are depicted with a circle 
representing a single subunit and placed adjacent to the corresponding 
peak. Boiled (+) and unboiled (-) samples of each purified protein were 
separated on 12.5% SDS-PAGE and stained with Coomassie blue. 
doi:10.1371/journal.pone.0102454.g004 

with maltose binding protein to improve Orf solubility, simplify 
the recovery of milligram quantities of pure protein and facilitate 
far-western assays for protein-protein interactions. 

Quaternary and secondary structure of MBP-Orf mutants 
and effect on SSB binding 

To confirm that the six mutations did not substantially affect 
dimerization or folding, size exclusion chromatography, static light 
scattering experiments and circular dichroism (CD) spectroscopy 
were performed. The quaternary structure of each mutant was 
assessed by size exclusion chromatography. The results reveal that 
the mutant proteins tend to elute as a single major peak of 123- 
127 kDa (Figure 4) in keeping with the 120 kDa predicted for an 
MBP-Orf homodimer. Dimeric species of 135-140 kDa resistant 
to denaturation were also observed on SDS-PAGE when purified 
samples were separated in the absence of a reducing agent and 
without boiling (Figure 4). The Q45A mutant showed an 
additional peak in gel filtration that corresponded to the size of 
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a monomer (Figure 4A) that may indicate that this residue 
normally contributes to dimerization, although this may not be a 
direct effect since it is located far from the subunit interface. Gel 
filtration analysis on V106E revealed two major peaks with the 
larger species consistent with formation of a pair of Orf dimers 
(Figure 4E). However, when samples were reexamined at pH 7, 
instead of pH 8, a single dimeric peak at 125 kDa was observed at 
both 0.5 mg/ml or 1 mg/ml protein (data not shown). Thus 
electrostatic interactions may be important in the formation of 
multimeric complexes in this mutant. 

Each of the mutants was also examined by static light scattering, 
which determines the particle size in solution and provides an 
estimate of the oligomeric status of a protein sample. MBP-Orf 
yielded a 6.1 nm particle radius and similar sizes of 5.2-5.9 nm 
were obtained with Q45A, K48A, W50A, R103E and W137A 
(Figure 5A and Table S3). These correspond to an average 
molecular mass of 188 kDa, about 1.6 times that of an MBP-Orf 
dimer, although estimates are based on the assumption that a 
particle is spherical. The smallest dimension of 5.2 nm was 
obtained with R103E potentially indicating some conformational 
change affecting particle shape (Table S3). The V106E mutant 
differed from the other mutants (Figure 5A) in producing a large 
particle size with a radius of 1 14.9 nm (Table S3). It also showed a 
high percentage polydispersity of 55.8% suggestive of protein 
aggregation. Under these conditions, MBP also produced an 
apparent multimer of 17.9 nm (Table S3), which is in accord with 
previous gel filtration data indicating it exists as a large protein 
complex [13]. 

The mutant MBP-Orf proteins were also studied by CD 
spectroscopy to assess secondary structure and folding properties. 
All six mutants appear to be folded correctly (Figure 5B) and 
secondary structure predictions suggest that the R103E, V106E 
and W137A do not differ significandy from wt MBP-Orf (Table 
S2). The three mutants clustered around the central channel on a.2 
of the Orf dimer, Q45A, K48A and W50A, do however show 
differences in the proportion of predicted a-helix, yS-sheet and turn 
(Table S2). K48A showed the most dissimilar CD spectrum in the 
far-UV region (Figure 5B) with a significandy increased percent- 
age of ot-helix and a corresponding decrease in both /?-sheet and 
turn (Table S2). Hence, although these three proteins are folded 
and dimerize normally, the alanine substitutions may affect the 
local conformation of a.2 and adjacent secondary structures. 

Taken together the data do not reveal any substantial changes 
in mutant protein folding or homodimer formation, although 
subtle changes in secondary structure were observed with Q45A, 
K48A and W50A. V106E has a propensity to form multimers as 
noted in gel filtration and static light scattering experiments. In the 
case of the latter mutant it is possible that the presence of an 
additional negative charge on the surface encourages protein- 
protein interactions between Orf dimers that are generally 
positively charged over their protein surface. 

Far-western blotting experiments were performed to determine 
whether any of the Orf mutants affected the association with E. 
coli SSB. None of the mutants showed any reduction in interaction 
with SSB (Figure S6), suggesting that contacts between the two 
proteins must be located elsewhere. 

Effect of MBP-Orf mutations on DNA binding 

The impact of the six MBP-Orf substitution mutants (Fig- 
ure 6A) on ssDNA and bubble DNA binding was examined in 
electrophoretic mobility shift and fluorescence anisotropy assays. 
Binding to a 50 nt radiolabeled linear ssDNA was examined first 
(Figure 6B) in the absence of metal ions. MBP-Orf wt forms a 
single major complex with the ssDNA (Figure 6B, lanes a-b), with 



30 



0 





MBP 




MBP-Orf 




W50A 




Q45A 




K48A 




1 A V106E 


1 1 ' 


| / \ W137A 







1 10 100 1000 

radius (nm) 




-6 J 1 1 . . 1 1 1 1 

180 190 200 210 220 230 240 250 260 

X./ nm 



Figure 5. CD spectra and SLS analysis of MBP-Orf mutant 
proteins. (A) SLS analysis of selected MBP-Orf mutant proteins. 
Proteins at 0.2 mg/ml in 100 mM Tris-HCI pH7 at 25 C were analyzed in 
a Zetasizer |j.V. (B) CD analysis of MBP-Orf mutant proteins. CD spectra 
(180-260 nm) were obtained for MBP-Orf proteins in ultrapure water at 
20°C and analysis performed using CDSSTR, Contin/LL and Selcon3 
programs from the CDPro suite [54]. 
doi:1 0.1 371/journal.pone.01 02454.g005 

smearing consistent with an unstable interaction with the substrate 
as noted previously [13]. The Q45A mutant showed a minor 
defect in binding as evidenced by the increased presence of 
unbound substrate (Figure 6B, lane d) relative to the wt. W137A 
also showed a slight reduction in ssDNA binding, similar to that 
seen with a W141F mutant [13] and consistent with the tz5 helix 
contributing to DNA binding stability. More severe defects in 
ssDNA binding were evident with K48A and V106E, while W50A 
bound only weakly (Figure 6B). No DNA binding was detected 
with the R103E mutant (Figure 6B, lanes j-k). 

Gel shift experiments were also performed on dsDNA (DS 60 ) 
and 20 nt bubble (BB 2 o) substrates (Figure 6C). The results 
generally mirrored those with SS 50 (Figure 6B), although they 
highlighted more severe DNA binding defects with several of the 
mutants. While wt MBP-Orf binds relatively poorly to DS 60 
(Figure 6C, lane b), K48A, W50A, R103E and V106E displayed 
litde or no association with this substrate (Figure 6C, lanes d-g). 
Similarly poor binding was observed with the bubble DNA, 
although K48A did bind a small proportion of BB 2 o but only as a 
much faster-migrating complex (Figure 6C, lane 1). Q45A and 
W137A showed less extreme defects on DS 60 (Figure 6C, lanes c 
and h). However, the stability of complexes detected on BB ao was 
reduced slightly with W137A (Figure 6C, lane p) and significandy 
with Q45A (Figure 6C, lane k) compared to the wt. Interestingly, 
Q45A showed an inability to form the two closely-migrating 
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Figure 6. DNA binding by MBP-Orf mutant proteins. (A) Crystal structure of Orf protein (1PC6) highlighting the location of site-directed 
mutations. Chain A is colored cyan and Chain B, lilac. The projecting C-terminal helix is absent from the latter. (B) MBP-Orf mutant binding to ssDNA. 
Gel mobility shift assay contained 0.3 nM 32 P-labelled ssDNA (SS 50 ) and protein at 150 and 500 nM as indicated. Samples were separated on 4% 
polyacrylamide gel electrophoresis and visualized by autoradiography. (C) MBP-Orf mutant binding to double-stranded and bubble DNA. The gel 
shift assay contained 0.15 nM 32 P-labelled dsDNA (DS 60 ; lanes a-h) or bubble DNA (BB 20 ; lanes l-p) and 250 nM MBP-Orf protein. Samples were 
separated by 4% polyacrylamide gel electrophoresis and visualized by autoradiography. (D and E) MBP-Orf mutant binding to 10 nM fluorescein- 
labeled ssDNA (D) and BB 2 o DNA (E) as determined by fluorescence anisotropy. Binding isotherms are colored as in (A). Data are the mean and 
standard deviation of two independent experiments. 
doi:1 0.1 371 /journal. pone.01 02454.g006 



complexes seen with the wt and W137A on BB 2 o (Figure 6C, 
compare lanes j, k and p). 

Fluorescence anisotropy was again used to give quantitative 
information on binding to SS 60 and BB 20 substrates. Binding 
isotherms are presented (Figure 6D-E) alongside dissociation 
constants derived from the data (Table 1). The results match well 
those obtained in gel shift assays (Figure 6C) with gradually 
decreasing DNA binding activity in the order wt, Q45A, W137A, 
K48A, V106E, W50A and R103E. However, differences were 
noted within and between the two substrates that were particularly 
evident in the estimated Krj for each mutant protein (Table 1). In 
all cases, Q45A showed only a slight defect in DNA binding 
relative to wt. Similarly, R103E was the most severely deficient in 
DNA binding, followed by W50A. The remaining three mutants 
varied in order depending on the substrate. V106E bound better 
than K48A and W137A to SS 60 , whereas W137A bound more 
tightly than V106E and K48A to BB 20 (Table 1). W50A also 
showed a preference for binding to BB 20 over SS 60 . The observed 
anisotropy gradually decreased in the V106E mutant at concen- 
trations above 4 |iM with both SS 60 and BB 20 (Figure 6D-E). This 
could be due to a contaminant nuclease destroying the substrate, 
but it is more likely that V106E aggregates as the amount of 
protein increases resulting in a reduction in protein available for 
DNA binding. This conclusion fits with the data in Figures 4E and 



5 A showing that V106E has a tendency to form higher molecular 
mass complexes. 

Discussion 

Phage X Orf substitutes for the E. coli RecFOR proteins in 
genetic exchange reactions, possibly by directing assembly of 
either bacterial RecA or phage /? recombinases onto ssDNA 
coated with SSB [8,11]. Consistent with this role, purified Orf 
interacts with both E. coli SSB and ssDNA substrates [12,13]. In 
this study we identified a family of Orf recombinases from diverse 
lambdoid phages sharing similarity in primary sequence, predicted 
secondary structure and genomic context within a conserved 
module located between replication and lysis genes. The 
authenticity of the proposed family relationship was investigated 
by studying the biochemical properties of YbcN, one of the most 
distantiy-related members of the Orf family. YbcN is encoded by 
the E. coli cryptic lambdoid prophage DLP12, is 97% identical to 
a protein from phage 82 [18] and has previously been shown to 
bind preferentially to abasic sites and T:G mismatches, with the 
capacity to flip mispaired bases out of the double helix [15]. 
However, an involvement of YbcN in mismatch recognition does 
not seemingly fit with an activity responsible for suppressing the 
recombination and UV light repair deficiencies of recFOR 
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mutants. The specific DNA binding properties of YbcN were 
therefore examined in direct comparison with X Orf. 

YbcN shared with Orf a preference for binding to short 50 or 60 
nt ssDNA substrates rather than duplexes of the same length and 
sequence. However, YbcN bound much less well to ssDNA than 
Orf in the absence of metal ions, whereas their binding profiles 
were more similar in the presence of Mg 2+ ions (conditions more 
closely resembling those found in the cell). No difference in 
binding was detected between a fully complementary dsDNA and 
one containing a single T:G or G:G mismatch with YbcN protein, 
although Orf did show a slight preference for the T:G mispair. 
YbcN showed no improved binding to related substrates 
containing a bubble of 5, 13 or 20 nt in gel shift assays, although 
a slight preference for 20 nt bubble DNA was evident in 
fluorescence anisotropy assays. In contrast, Orf bound much 
more tightly to these bubble DNA structures in keeping with its 
ability to load on ssDNA. It is possible that YbcN has a larger 
footprint than Orf and therefore requires a longer stretch of 
ssDNA to assemble in a stable complex. Alternatively, it may be 
that a free end is necessary for loading of YbcN, and that the 
flanking duplexes in these bubble substrates prevented assembly. 
The DNA binding properties of YbcN, and the observation that it 
forms homodimers, are in accord with a recombinase that is 
homologous to X Orf and are consistent with a role at the early 
stages of recombination, binding to ssDNA exposed by phage 
exonuclease or targeting replication forks or bubbles in the 
template. 

A high-throughput screen of E. coli protein-protein interactions 
had previously indicated an association between YbcN and SSB 
[32]. Far western blotting, ELISA and yeast two-hybrid experi- 
ments confirmed this interaction, with YbcN showing a higher 
binding affinity for its bacterial partner than Orf. These results 
confirm that a distandy-related Orf family member has the 
capacity to bind bacterial SSB and supports the notion that they 
form a conserved, functional complex. Multiple members of the 
replication, recombination and repair apparatus have been shown 
to target the negatively-charged C-terminus of SSB [25], 
presumably to promote delivery at sites requiring their action. 
Previous studies had revealed that the terminal 10 residues of SSB 
were dispensable for X Orf binding [13]. In contrast, YbcN showed 
significandy reduced binding to SSB mutant proteins lacking this 
region (AC 10) or carrying an SSB1 13 mutation, known to disrupt 
interactions with partner proteins [33,34], Some residual binding 
of YbcN to SSB was detected by far western blotting and yeast 
two-hybrid assays, suggesting there may be other portions of the 
flexible C-terminus of SSB (residues 1 13-177) or elsewhere that are 
more important for the Orf-SSB association. It may be that the 
region on SSB targeted by Orf contributes to remodeling of the 
SSB-ssDNA complex that allows loading of recombinase onto the 
exposed ssDNA. It is noteworthy that while both YbcN and Orf 
interact specifically with E. coli SSB, they appear to do so by 
recognizing disparate features on their bacterial partner. 

Although there are differences in YbcN and Orf binding to 
DNA, their preference for single-stranded DNA does support the 
prediction that they belong to a conserved family with a 
comparable role in genetic recombination. It is difficult to 
reconcile any specificity for abasic sites and mismatches [15] with 
an activity that suppresses defects associated with mutation of the 
recF, recO or recR genes, unless the Orf domain has been co-opted 
to generate a base-flipping enzyme in YbcN. Neither Orf nor 
YbcN share any homology with enzymes that might incise DNA at 
a mismatch (e.g. a DNA glycosylase or apurinic/apyrimidinic 
endonuclease) or to modify any flipped out bases (e.g. a 
methyltransferase). Moreover, under the assay conditions used 



here, no specificity for G:G or T:G mismatches was detected with 
YbcN protein. One possibility is that Orf and YbcN can in certain 
circumstances capture and stabilize exposed DNA bases as part of 
their normal function in recognizing ssDNA for assembly of 
partner recombinases. Such an activity may ensure that a sufficient 
amount of ssDNA is released from SSB-ssDNA complexes to 
permit nucleation of a partner recombinase. RecOR are known to 
accelerate nucleation of RecA on SSB-coated ssDNA, apparently 
by trapping transient states where an SSB tetramer slides or 
unwraps from the DNA strand [35]. Recognition of SSB by YbcN 
and Orf, as with RecO [36,37], would allow targeting of 
appropriate ssDNA substrates even if the protein-protein interac- 
tions do not directly contribute to SSB dissociation [36] . 

The prophage encoded YbcN could contribute directly to 
homologous recombination reactions in E. coli, although exper- 
iments with rusA [18], which lies downstream oiybcN in DLP12, 
suggest that insertion mutations that supply an active promoter 
upstream of this operon are needed for adequate gene expression 
at this locus. However, the presence of homology with other 
lambdoid phages in this region does mean that ybcN can serve as a 
repository of alternative phage genetic material that can be picked 
up by incoming phage, thus contributing to the ongoing evolution 
of mosaic genomes [38]. The presence of homologs of the 
defective prophage DLP12 ybcN gene in functional phage 
genomes, including 82 and ^80, suggests that it does specify an 
active Orf-like protein. 

The crystal structure of Orf [12] suggested two possible ssDNA 
binding modes, either negotiating the narrow (8 A) central channel 
or bound within a shallow groove running perpendicular to this 
cavity. Binding to a gapped duplex substrate, and the bubble DNA 
substrates used in this study, implied that the latter interaction was 
the genuine one [13]. However, the experiments did not exclude 
the possibility that the dimeric ring opens to permit ssDNA access. 
Electrostatic potential calculations performed on the Orf crystal 
structure support this possibility as the interior of the toroid is 
highly positively charged [12]. Furthermore, several of the most 
highly conserved residues in the Orf family line the walls of this 
cavity. The identification of a new DNA binding fold in the N- 
terminal portion of the Orf structure also fits with this idea. This 
domain, termed RAGNYA [14], comprises two a-helices (od from 
each monomer in Orf) and four /J-strands (fil and p2 from each 
Orf subunit) with the predicted DNA binding surface across the 
face of the sheet region and within the central cavity. It may be 
that additional contacts involve the C-terminal helix, which 
protrudes from either side of the dimeric ring (Figure 4) and has 
previously been shown to participate in DNA binding [13]. 

Mutants in Orf at several conserved sites within or at the 
entrance to the central channel, alongside a substitution in the C- 
terminal helix, were studied to help clarify their contribution to 
ssDNA recognition. The W50A mutant protein exhibited a 
significant defect in DNA binding providing the first clear 
evidence that the central cavity is important for stabilizing 
contacts with ssDNA. Trp50 projects from both sides into the 
space within the dimer interior and could stabilize binding to 
nucleotide bases as they enter the channel. Lys48 is located nearby 
on the cavity rim and a K48A mutation also conferred a significant 
defect in DNA binding. This residue may make contact with tx- 
helix 5 and either direcdy or indirectly affect ssDNA that may 
traverse the groove between this helix and the main body of the 
dimer, potentially allowing ssDNA to wrap around the toroid. 
Surprisingly, the Q45A mutant displayed only a minor defect in 
DNA binding activity compared to the wt, despite being highly 
conserved among the Orf family. It may play a more subde role as 
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the formation of protein:DNA complexes in gel shift assays showed 
some variation from the wt on both bubble and dsDNA structures. 

Mutation of Tip 137 to alanine resulted in a modest reduction in 
DNA binding, comparable to other substitution mutants in the Orf 
C-terminal helix (R132A, R140A and W141F) [13]. This entire 
helix is flexible and could adopt a variety of conformations to 
clamp onto DNA or associate with the RAGNYA domain as 
suggested previously [13]. The two remaining mutants, R130E 
and V106E, are located close to the cavity entrance. V106E 
showed a significant defect in DNA binding, while R103E was 
completely defective in binding any of the DNA substrates tested. 
The V106E mutant had a tendency to aggregate and thus it may 
serve in a structural role, stabilizing intrasubunit interactions 
between helices a.2 and oc4. Addition of a negatively-charged 
residue close to the rim of the channel could also disrupt 
electrostatic interactions between Orf and the DNA backbone. 
The R103E mutant could behave in a similar fashion as it lies at 
each end of the central channel and could be critical for stabilizing 
contacts with ssDNA. Argl03 in one of the Orf monomers forms a 
salt bridge with both Asp68 and Glu70 from the other subunit, 
however, the R103E mutant does appear to fold and dimerize 
normally suggesting that it does not significantly disrupt the dimer 
interface. If the Orf protein can open as a clamp, the R103E 
substitution could potentially affect opening and closing of the ring 
preventing its association with DNA. Argl03 and Vail 06 lie in a 
short motif that resembles part of the subunit interface in RecA 
[29,30] and this region protrudes on both sides of the Orf cavity, 
ideally situated to aid nucleation of RecA onto ssDNA. However, 
as yet, we have been unable to confirm a physical interaction 
between Orf and RecA. 

The assembly of Orf on ssDNA in vivo could facilitate loading 
of either the bacterial RecA strand exchange protein or phage /? 
strand annealing protein to stimulate recombination [1]. Assuming 
that Orf behaves similarly to recF, recO and recR, the genes it 
substitutes for in vivo [8,11], it could help nucleate RecA at 
replication forks, D-loops (similar to bubble structures) or at gaps 
resected in duplex DNA. This scenario would be analogous to 
bacteriophage T4, whereby the UvsY mediator promotes assem- 
bly of the UvsX recombinase (a RecA homolog) onto ssDNA, 
overcoming the block presented by the phage Gp32 ssDNA 
binding protein [39] . RecFOR are known to help RecA filaments 
assemble at dsDNA-ssDNA junctions [40-42], destabilizing SSB- 
ssDNA complexes that normally prevent access to the template 
[35]. The fact that RecFOR promote RecA filament assembly at 
gaps with either DNA or RNA at the 5 ' terminus suggests that the 
stalled replication fork may well be a target for their mediator 
function [43]. Facilitated loading of RecA in this context could 
sanction lesion bypass and replication restart downstream of the 
block. Restoration pathways that depend on fork regression have 
been suggested [44,45] and there is evidence that RecFOR play an 
important role [46] that can function independently of RuvAB- 
mediated Holliday junction processing [47]. If Orf could stimulate 
a similar recovery pathway, even by simply safeguarding the fork, 
this might explain why it can also partially substitute for the 
activities of RuvABC in Holliday junction branch migration and 
resolution [1 1]. 

Orf suppresses the recombination and DNA repair defects of 
recFOR mutants even in the absence of Reda/J [8,11]. However, 
recent experiments examining recombinational exchanges be- 
tween partially homologous partners on phages and prophages, 
suggest that Orf is required primarily for exchanges promoted by 
the Red system rather than by RecA [38]. It is possible that 
RecFOR normally prevail over Orf in mediating the loading of 
RecA onto ssDNA bound by SSB, whereas the bacterial activities 



are less active in situations involving the Red system (e.g. at DNA 
ends). Depending on the context and availability of recombination 
enzymes, it may be that Orf aids nucleation of either phage Red/? 
or bacterial RecA to promote recombinational exchange. Both 
RecA (strand invasion) and Red (annealing) pathways [48] can 
promote the recovery of viable phage genomes, although the latter 
may lead to more rearrangements at sites of limited sequence 
homology, thus fostering phage genetic diversity and mosaicism 
[38]. 

Materials and Methods 

Sequence analysis 

Proteins related to Orf were identified using the TBLASTN and 
PSI-BLAST algorithms [49] on the non-redundant database at the 
National Center for Biotechnology Information (NIH, Bethesda, 
MD). An inclusion threshold of 0.05 was applied in iterative 
searches to restrict those matches selected in the position-specific 
weight matrix (PSSM). Sequences were aligned using ClustalW 
with a Gap Creation Penalty of 10 and Gap Extension Penalty of 1 
[50]. Phylogenetic relationships were estimated by the neighbor- 
hood-joining method incorporated within the PAUP package [51]. 
The Jnet protein structure prediction tool on the Jpred server was 
used to estimate secondary structure features of E. coli DLP12 
YbcN and S. aureus <j>KTA Orf20 [52]. The Phyre 2 protein fold 
recognition server is an improved version of 3D-PSSM [53]. 

MBP and GST fusion constructs 

The orf-like gene (orfl51 or ybcN) from E. coli prophage 
DLP12 [18] was amplified from AB1 157 genomic DNA using Pfx 
DNA polymerase and oligonucleotides 5'-CGGAGG GAATT- 
CATGAACCTCTCAC-3' and 5 ' -GAC GATTTG- 

GATCCCTGTAGATGTG-3 ' . The PCR product was digested 
with EcoRI and BamHI (underlined) and inserted into pMALc2 to 
give pLBlOl. This construct expresses E. coli MBP (maltose 
binding protein) fused to the N-terminus of YbcN. An N-terminal 
GST (glutathione S-transferase) fusion was constructed by 
transferring the insert from pLBlOl, using EcoRI and Sail sites, 
into pGEX-6P-l (GE Healthcare) to generate pLB108 (GST- 
YbcN). 

The QuickChange (Stratagene) site-directed mutagenesis meth- 
od was used to generate Q45A (pFC256), K48A (pFC262), W50A 
(pFC259), R103E (pFC265), V106E (pFC267) and W137A 
(pEHlOl) mutant derivatives in pMALc2. The integrity of all 
cloned genes was confirmed by DNA sequencing. 

Proteins 

GST, GST-Orf, MBP, MBP-Orf, SSB, SSB113 and SSBAC10 
proteins were purified as described [13]. Purified SSBAl 15-144 
protein [27] was kindly provided by Robert Lloyd, University of 
Nottingham. MBP- YbcN was purified from 1 1 of DH5o: or BL2 1 
(DE3) Rosetta carrying pLBlOl grown in LB broth supplemented 
with 10 mil glucose and ampicillin at 150 u.g/ml and 37 ug/ml 
chloramphenicol as required. Expression was induced with 
0.5 mM IPTG when the culture reached an Agsonm of 0.5 and 
incubation continued for 3 hours at 37°C. Harvested cells were 
resuspended in column buffer (20 mM Tris-HCl pH7.4, 200 mM 
NaCl, 1 mM EDTA) and lysed by sonication. The cleared lysate 
was applied to a 4 ml amylose column (New England Biolabs) in 
column buffer and bound proteins eluted in the presence of 
10 mM maltose. Fractions containing MBP- YbcN were collected 
and dialyzed in 20 mM Tris-HCl pH 8.0, 1 mM EDTA, 0.5 mM 
DTT, 200 mM KC1, 50% (v/v) glycerol. Aliquots were stored at - 
80°C and a total of 70 mg of highly pure MBP- YbcN was 
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recovered at a concentration of 17.5 mg/ml. Additional stocks of 
MBP-YbcN were purified by both amylose and heparin agarose 
chromatography, with the majority of protein eluting from the 
latter matrix at 0.45 M NaCl. 

GST-YbcN was purified from 6 1 of BL21 Codon Plus cells 
carrying pLB108. Cells were grown at 25°C in LB broth 
containing ampicillin at 150 ug/ml and protein expression 
induced as before. Harvested cells were resuspended in PBS 
(140 mM NaCl, 2.7 mM KC1, 10 mM Na 2 HP0 4 , 1.8 mM 
KH 2 P0 4 pH 7.3) and lysed in a cell disruptor at 15-25 psi. The 
supernatant was applied to a 4 ml glutathione sepharose 4B 
column (GE Healthcare) equilibrated in the same buffer. Fusion 
proteins were eluted in 50 mM Tris-HCl, 10 mM reduced 
glutathione pH 8.0 and further purified by passage through 
heparin agarose and Q_ sepharose chromatography, collecting the 
flow-though at each stage. A total of 32 mg GST-YbcN (16.2 mg/ 
ml) was recovered. 

MBP-Orf mutant proteins were purified from 500 ml of BL21- 
AI carrying the appropriate pMALc2 construct as described [13]. 
MBP-Orf Q45 A, W50A, R103E and V106E were further purified 
by heparin-agarose chromatography after the amylose affinity 
purification step. 

Protein concentrations were determined in a NanoDrop 2000 
(Thermo Scientific) micro-volume spectrophotometer and by a 
modified Bradford assay (BioRad) using bovine serum albumin as 
a standard; amounts of Orf proteins are expressed as moles of 
dimer. Restriction endonucleases, T4 DNA ligase, T4 polynucle- 
otide kinase and Pfx DNA polymerase were obtained from 
Invitrogen. 

DNA substrates 

Oligonucleotides used in this study are listed in Table S4. The 
50 nt ssDNA substrate (SS 50 ) consisted of o-SS 50 , which was 
annealed to its complement (o-DS 50 ) to give a 50 bp dsDNA 
duplex (DS 5 o). The 60 nt ssDNA substrate (SS 60 ) consisted of o- 
SSeo, which was annealed to its complement (o-DS 60 ) to give a 
fully complementary 60 bp dsDNA duplex (DS 60 ). Centrally 
located mismatch (MM) or bubble (BB) structures of 1, 5, 13 and 
20 nt (MM G:G , MM T:G , BB 5 , BB 13 and BB 20 ) were made by 
annealing o-SS 60 with o-MM G:G , o-MM X:G , 0-BB5, o-BB 13 and o- 
BB 2 o, respectively. Bent DNA substrates containing 1 (BTi), 2 
(BT 2 ) or 3 (BT 3 ) adenine insertions were made by annealing o- 
SS 60 with o-BT,, o-BT 2 and 0-BT3, respectively. In gel shift 
assays, 0-SS50 or o-SS 6 o were labeled with [y P] ATP at the 5' 
end using T4 polynucleotide kinase. Labeled DNA was separated 
from unincorporated nucleotide using MicroBioSpin columns 
(BioRad). Annealed substrates were further purified by separation 
on 10% polyacrylamide gels in 90 mM Tris-borate, 2 mM EDTA. 
A 60 nt fluorescein 3' -end-labeled ssDNA equivalent in sequence 
to o-SS 60 was used in fluorescence anisotropy experiments, 
alongside DS 60 , MMx :G and BB 2() substrates generated by 
annealing the fluorescein-labeled strand to appropriate partner 
oligonucleodides. 

DNA binding assays 

Band shift assays (20 ul) with 50 and 60 nt 32 P-labelled DNA 
substrates were conducted in 50 mM Tris-HCl pH 8.0, 5 mM 
EDTA, 1 mM dithiothreitol, 5% glycerol, 100 ug/ml BSA. 
Samples were incubated on ice for 15 min prior to separation 
on 4% PAGE in 6.7 mM Tris-HCl pH 8.0, 3.3 mM sodium 
acetate, 2 mM EDTA at 160 V. EDTA was replaced with 1 mM 
MgCl 2 in both binding mixtures and gels to assess protein-DNA 
interactions in the presence of Mg 2+ ions. Gels were dried on filter 



paper and analyzed by autoradiography and phosphorimaging. 
Data were analyzed using ImageQuant and ImageJ software. 

DNA-protein interactions measured by fluorescence anisotropy 
utilized 3 '-fluorescein-labeled derivatives of SS 60 , DS fi0 , MMx :G 
and BB 20 in 100 mM Tris-HCl pH 7.0. Protein was titrated into 
1 ml of 10 nM DNA in a 1 ml two-way quartz cuvette (10 mm 
path length). Changes in anisotropy were monitored on a modified 
Cary Eclipse Fluorescence Spectrophotometer (Agilent Technol- 
ogies) fitted with polarizing filters with excitation at 494 nm and 
emission at 52 1 nm. Five replicates were taken with an averaging 
time of 20 sec. Data were analyzed using DynaFit 3 software to 
estimate the K D for MBP-YbcN, MBP-Orf wt and mutant proteins 
on SS 60 and BB 20 . 

Size exclusion chromatography 

The apparent molecular mass of proteins was estimated using 
an AKTA FPLC system with a 24 ml Superose 12 HR 10/30 
column (GE Healthcare). Proteins (1 mg/ml) were applied to the 
gel filtration column in 20 mM Tris-HCl pH 8.0, 1 mM EDTA, 
0.5 mM DTT, 150 mM KC1 at a flow rate of 0.5 ml/min. 
Molecular mass standards (BioRad) contained thyroglobulin 
(670 kDa), y-globulin (158 kDa), ovalbumin (44 kDa), myoglobin 
(17 kDa) and vitamin B12 (1.35 kDa). 

Far western blotting 

Purified SSB wt and mutant proteins were separated on 15% 
SDS-PAGE and transferred to a PVDF membrane by electro- 
blotting in 48 mM Tris, 39 mM glycine, 1.3 mM SDS. A 
prestained protein molecular weight standard (BioRad) served as 
a size marker. Blots were probed with YbcN or Orf fused to MBP 
or GST and interactions detected with monoclonal anti-MBP or 
anti-GST antibodies and mouse IgG peroxidase conjugate 
(Sigma). Chemiluminescence was observed by exposure to X-ray 
film following treatment with ECL reagents (GE Healthcare). 
Appropriate controls, either positive (appropriately tagged YbcN 
or Orf bound to the membrane) or negative (omitting tagged 
YbcN/ Orf or including purified MBP or GST), were analyzed in 
parallel. Protein transfer to membranes was monitored by staining 
with 1% amido black (Sigma) in 10% acetic acid. 

ELISA 

SSB wt and mutant proteins in PBS were applied to a flat 
bottom Immulon 2HB microtitre plate (Thermo Scientific) and 
incubated for 16 h at 37°C to allow binding. Wells were washed in 
PBS containing 0.3% Tween-20 (Sigma) and incubated for 1 hr at 
37°C in this buffer containing 5% milk powder. After washing, 
GST-Orf or GST-YbcN proteins were added and incubated for 
1 hr at 37°C. Wells were washed again prior to addition of HRP- 
anti-GST antibody (Sigma) at a 1:1000 dilution. Tetramethylben- 
zidine (Sigma) was added and the intensity of the color reaction 
measured at Ag^nm m an Anthos HTII plate reader. 

Static light scattering 

Proteins at 0.2 mg/ml (50 ul) in 100 mM Tris-HCl pH 7 at 
25°C were analyzed in a Zetasizer uV (Malvern Instruments Ltd). 
Thirteen measurements at 10 sec intervals were recorded and the 
data set repeated in triplicate for each sample. 

Circular dichroism (CD) spectroscopy 

Proteins were dialyzed overnight in Slide-A-Lyzer Mini Dialysis 
Units (Pierce) in ultrapure water at 4°C. Dialyzed samples were 
centrifuged at 15 000 rpm for 30 minutes to remove any 
precipitated protein. Protein concentrations were determined by 
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measurement of the absorbance at 280 nm. Far-UV CD spectra 
and the corresponding blanks were recorded on a Jasco J-8 1 0 
Spectropolarimeter in a 1 ml cuvette (0.2 cm path length). Eight 
accumulations recorded at a rate of 10 nm/min, a pitch of 
0.5 nm, a bandwidth of 1 nm and a response time of 2 s, were 
averaged. After subtraction of the appropriate blank, adaptive 
smoothing was carried out within the Jasco Spectra Analysis 
program (version 1.50, Jasco, Great Dunmow, UK). Smoothed 
data in the range 185-240 nm were analyzed for protein 
secondary structure using the CDSSTR, SELCON3, and CON- 
TIN/LL programs in the CDPro package [54] with the SP37 
reference set of soluble proteins [55]. Secondary structure content 
of known crystal structures was determined by DSSP [56] and 
accessed through the sequence retrieval system (SRS) at the 
European Bioinformatics Institute (http:/ /srs. ebi.ac.uk). 

Supporting Information 

Figure SI Phylogram of representative Orf family proteins. The 
tree was obtained by the neighborhood joining method using 
Staphylococcus aureus phage 187 and Listeria innocua sequences 
linl 258 and linl740 sequences as the outgroup. S. typhimurium 
phage ST64T and E. coli 0157:H7 phage 933W were excluded 
from the phylogenetic tree because they carry incomplete orf 
genes. Only branches with significant values (>70%) are shown. 
Abbreviations are: Eco, E. coli; Sso, Shigella sonnei; Plu, 
Photorhabdus luminescens; Sen, Salmonella enterica; Bbr, Borde- 
tella bronchiseptica; Nar, Novosphingobium aromaticivorans; 
Mma, Magnetospirillum magnetotacticum; Psy, Pseudomonas 
syringae; UPEC, Uropathogenic E. coli; Sty, Salmonella enterica 
serovar Typhimurium; Sfl, Shigella flexneri; Ype, Yersinia pestis; 
Ahy, Aeromonas hydrophila; Hin, Haemophilus influenzae; Aac, 
Actinobacillus actinomycetemcomitans; Ngo, Neisseria gonorrhoeae; 
Bvi, Burkholderia vietnamiensis; Lin, Listeria innocua; Sau, 
Staphylococcus aureus. Four groups of closely-related homologs 
are highlighted in color. The true phylogeny is difficult to ascertain 
due to the likelihood of gene transfer between phage genomes. Full 
designations of sources are given in Table SI. 
(TIFF) 

Figure S2 Structural similarity between 1 Orf, DLP12 YbcN 
and ^ETA Orf20 proteins. Structural models of E. coli DLP12 
YbcN (residues 16-147), belonging to the PRK09741 domain, and 
S. aureus ^ETA Orf20 (residues 26-130), a member of the 
DUF968 domain, were based on the X Orf crystal structure (1PC6) 
using Phyre 2 . Only subunit A is shown, with the RAGNYA 
domain colored in yellow. Conserved and potential functionally- 
equivalent residues located close to the central channel of the 
dimer are highlighted. 
(TIFF) 

Figure S3 YbcN binding to single-stranded and bubble DNA. 
Comparison of MBP-YbcN binding to 10 nM fluorescein-labeled 
SS 60 and BB 2 o as determined by fluorescence anisotropy. Data are 
the mean and standard deviation of two independent experiments. 
(TIFF) 

Figure S4 YbcN and Orf binding to mismatch and bent DNA. 
(A) Comparison of MBP-YbcN and MBP-Orf binding to 10 nM 
fluorescein-labeled MM G:G and MM X:G DNA as determined by 
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fluorescence anisotropy. Data are the mean and standard 
deviation of two independent experiments. (B) Comparison of 
MBP-YbcN and MBP-Orf binding to bent DNA. Gel mobility 
shift assays contained 125 nM MBP-Orf (O) or MBP-YbcN (Y) 
proteins, 5 mM EDTA and 0.15 nM of :52 P-labelled 60 nt (SS 60 ) 
ssDNA (lanes a-c), 60 bp (DS 60 ) dsDNA (lanes d-f), 1 nt (BT,) 
insertion (lanes g-i), 2 nt (BT 2 ) insertion (lanes j-1) and 3 nt (BT 3 ) 
insertion (lanes m-o). 
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Figure S5 Analysis of DLP12 MBP-YbcN and GST- YbcN 
proteins. (A) CD analysis of YbcN proteins and binding to ssDNA. 
CD spectra (180-260 nm) were obtained for MBP, GST, MBP- 
YbcN and GST- YbcN proteins in ultrapure water at 20°C. Gel 
shift assays contained 0.3 nM 32 P-labelled ssDNA (SS 50 ), 5 mM 
EDTA and 62.5, 125 and 250 nM MBP-Orf (lanes b-d) and 250, 
500 and 1000 nM MBP-YbcN (lanes f-h) and GST- YbcN (lanes j- 
1). (B) Size-exclusion chromatography of DLP12 YbcN. MBP- 
YbcN and GST- YbcN proteins (1 mg/ml) were applied to a 24 ml 
Superose 6HR 10/30 column in 20 mM Tris-HCl pH8, 1 mM 
EDTA, 0.5 mM DTT, 250 mM KC1. The predicted molecular 
weights for each protein monomer are 60.4 kDa for MBP-YbcN 
and 44.6 kDa for GST- YbcN. Oligomeric states are depicted with 
a circle representing a single subunit and placed adjacent to the 
corresponding peak. Boiled (+) and unboiled (-) samples of each 
purified protein were separated on 12.5% SDS-PAGE and stained 
with Coomassie blue. 
(TIFF) 

Figure S6 MBP-Orf mutant protein binding to SSB in far 
western assays. SSB protein (5 |J,g) separated on 15% SDS-PAGE 
was blotted and probed with 30 u.g MBP-Orf mutant proteins. 
MBP-Orf-SSB interactions were detected with antibodies specific 
for the MBP domain. 
(TIFF) 
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